Case Study: How Modeling Revealed Serious Performance Problems in Distributed (DCE) Systems

نویسندگان

A. M. Khandker

T. J. Teorey

چکیده

Open Software Foundation's Distributed Computing Environment (OSF/DCE) [8] is a platform for distributed computing. DCE is a collection of tools and services for the development, use, and maintenance of transparent distributed application systems. The communication paradigm supported by DCE is the synchronous Remote Procedure Call (RPC) [1]. RPCs can be implemented on any transport layer protocol, such as TCP or UDP. RPCs over UDP can be optimized more than those over TCP. Therefore, RPCs over UDP are, in general, faster and hence of our interest in this paper. Fundamental to the overall performance of DCE is the RPC round trip time, also known as latency or response time. Round trip time is the time elapsed between when an RPC is invoked and when it is returned. In this paper, we focus on the round trip time of DCE RPC. Our earlier work describes analytic performance modeling techniques for distributed application systems [4]. Unfortunately, the techniques couldn't be validated because the model-predicted and measured round trip times didn't match. When the model predicted a decrease in the RPC round trip time, the measured round trip time showed an increase. The prediction of the model followed intuition but the actual measurement was counterintuitive. We concluded that a performance bug in the system was causing the round trip time anomaly and investigated the reason. The result of the investigation is described this paper. The objective of this paper is to illustrate how modeling a distributed system can reveal serious performance problems and lead to performance improvement. We start with a background of DCE RPC in Section 2. We develop a queueing network model for RPC in Section 3 and suggest a simple extension to the Mean Value Analysis (MVA) algorithm [6] to account for parallelism present in inter-machine RPCs. We rediscover the anomaly by comparing the modelpredicted round trip times with the measured round trip times in Section 4. Section 4.3 describes the anomaly. We discuss the cause behind the anomaly in Section 4.4 and suggest the x in 4.5. Section 5 describes our conclusions and future work.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Processing Reliability based a Clever Task Allocation Algorithm to Enhance the Performance of Distributed Computing Environment

----------------------------------------------------------------------ABSTRACT----------------------------------------------------------Distributed Computing Environment (DCE) is one the appropriate network for providing the optimal solution in real-time applications problems. The systematic allocation of tasks needs to plays the key role to optimize the overall processing reliability of the DC...

متن کامل

Client/server approach versus distributed object-oriented computing on top of OSF DCE

The OSF Distributed Computing Environment (DCE) has been established as an industry standard for open distributed computing based on the client/server model. Using the case of a distributed workflow approach, it is shown that DCE is a sound technological base, but that an extended higher-level object-oriented system support is better suited for such applications. Based on these observations, ex...

متن کامل

Performance modeling of the distributed computing environment

We develop an analytic performance model for client/server communications using Remote Procedure Call (RPC) with no security over the connectionless datagram protocol in the Open Software Foundation's Distributed Computing Environment (OSF/DCE). It combines features from NetMod, a network performance modeling tool, and the Method of Layers which is a software performance modeling tool. The purp...

متن کامل

Modeling and analysis of the Unix communication subsystems

The performance of host communication subsystems is an important research topic in computer networks.1 Performance metrics such as throughput, delay, and packet loss are important indices to observe the system behavior. Most research in this area is conducted by experimental measurement; far less attention is paid to the analytic modeling approach. The well-known complexity and dynamic nature o...

متن کامل

DISTRIBUTED AND COLLABORATIVE FUZZY MODELING

In this study, we introduce and study a concept of distributed fuzzymodeling. Fuzzy modeling encountered so far is predominantly of a centralizednature by being focused on the use of a single data set. In contrast to this style ofmodeling, the proposed paradigm of distributed and collaborative modeling isconcerned with distributed models which are constructed in a highly collaborativefashion. I...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1999

Case Study: How Modeling Revealed Serious Performance Problems in Distributed (DCE) Systems

نویسندگان

چکیده

منابع مشابه

Processing Reliability based a Clever Task Allocation Algorithm to Enhance the Performance of Distributed Computing Environment

Client/server approach versus distributed object-oriented computing on top of OSF DCE

Performance modeling of the distributed computing environment

Modeling and analysis of the Unix communication subsystems

DISTRIBUTED AND COLLABORATIVE FUZZY MODELING

عنوان ژورنال:

اشتراک گذاری